Detection of Text with Connected Component Clustering

نویسنده

  • Shahul Hammed
چکیده

Text detection and recognition is a hot topic for researchers in the field of image processing. It gives attention to Content based Image Retrieval (CBIR) community in order to fill the semantic gap between low level and high level features. Several methods have been developed for text detection and extraction that achieve reasonable accuracy for natural scene text (camera images) as well as multi-oriented text. However, it is noted that most of the methods use classifier and large number of training samples to improve the text detection accuracy. The multi-orientation problem can be solved using the connected component analysis method. To extract connected components (CCs) in images by using the maximally stable extremal region algorithm. These extracted CCs are partitioned into clusters so that we can generate candidate regions. Trained an AdaBoost classifier that determines the adjacency relationship and cluster CCs by using their pairwise relations. The scale, skew, and color of each candidate can be estimated from CCs, and develop a text/non text classifier for normalized images. This classifier is based on multilayer perceptrons and we can control recall and precision rates with a single free parameter. Finally, we extend our approach to exploit multichannel information. Experimental results on ICDAR 2005 and 2011 robust reading competition datasets show that our method yields the state-of-the-art performance both in speed and accuracy. KEYWORDS— Text detection, Content based Image Retrieval, Connected component based approach, CC clustering, machine learning classifier, non text filtering, scene text detection.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Arbitrarily Oriented Scene Text Detection using SMSER and Connected component analysis

In this work, rotation invariant approach has been explored and an effective rotation invariant text detection system has been proposed. In this discrete wavelet transform has been used to get the multi-level feature extraction of the text region as vertical, horizontal and diagonal coefficients provide variation in edge pixels of the text scene image. Further this, detailed and approximation c...

متن کامل

A comprehensive of transforms, Gabor filter and k-means clustering for text detection in images and video

Wavelet transform; Multilingual text; Wavelet decomposition; Gabor filter; k-Means clustering; Linked list approach; Wavelet entropy Abstract The present paper presents one of the efficient approaches towards multilingual text detection for video indexing. In this paper, we propose a method for detecting text located in varying and complex background in images/video. The present approach compri...

متن کامل

MOSLEH ET AL.: IMAGE TEXT DETECTION USING A NOVEL EDGE DETECTOR AND SWT1 Image Text Detection Using a Bandlet-Based Edge Detector and Stroke Width Transform

In this paper, we propose a text detection method based on a feature vector generated from connected components produced via the stroke width transform. Several properties, such as variant directionality of gradient of text edges, high contrast with background, and geometric properties of text components jointly with the properties found by the stroke width transform are considered in the forma...

متن کامل

Natural scene text localization using edge color signature

Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...

متن کامل

A robust hybrid method for text detection in natural scenes by learning-based partial differential equations

Learning-based partial differential equations (PDEs), which combine fundamental differential invariants into a non-linear regressor, have been successfully applied to several computer vision tasks. In this paper, we present a robust hybrid method that uses learning-based PDEs for detecting texts from natural scene images. Our method consists of both top-down and bottom-up processing, which are ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014